CDS

Accession Number TCMCG005C21950
gbkey CDS
Protein Id XP_020269441.1
Location join(75749561..75749745,75749747..75749906,75751632..75751852,75751854..75751929,75752499..75752651,75759519..75759643,75759747..75759842,75761000..75761240)
Gene LOC109844740
GeneID 109844740
Organism Asparagus officinalis

Protein

Length 420aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA376608;
db_source XM_020413852.1
Definition LOW QUALITY PROTEIN: histone-binding protein MSI1 homolog

EGGNOG-MAPPER Annotation

COG_category B
Description WD40 repeats
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03036        [VIEW IN KEGG]
KEGG_ko ko:K10752        [VIEW IN KEGG]
EC -
KEGG_Pathway ko04218        [VIEW IN KEGG]
map04218        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCGAAATCCGAAGAAGACTCTAAACCCGCCGCCGCCGATGATCGCCGGCTCGACGAAGAGTACAAGATCTGGAAGCAGAACACGCCGTTCATGTACACCTCGTCATCACCGACGTCTCGAGTGCCCTCGATGACGGTCTCAGGCTGCCGGAGCGAGCGGAGGCTCGCCGGAAAAGACTACTCCCACAAGCTGATCCTGGGGACCTACGCCGGTGACGAGCGTAACTATCTCATGATCGCCGAGGTCCAGCTCCCGATCGAGGACGCTGAGAGGCAAATGAGGGTCTTTGACGTCGATCACGGCGAGATCGGAGTGTTTGAAGGCGCTAGAGGCAAAAGCAAGGTAAAAGTAATTCAGCAGATAAAACACGAAGGAGAAGTTAATCGAGCTCGTTACATGCCTCAGAATTCTTCCCTTATTGCAACAAAGACAGTTAGTGCAGAGGTGCATGTATTTGATTACAGCAAGCATCCTTACAACCCTCCTATAAGCGGTGGTGAATGCAATCCTGATTTGAGGTTGAAGGGCCACAGCTCTGAAGGATATGGTTTATCATGGAGTCATGATGCTCAAATATGCTTGTGGGACATTAACGCAGCACCCATTAATAAGTCTCTTTGGCCTCTTAGAAGCTTTAAGGTAAATGAAGATGCTGTTGAGGATGTTGCGTGGCATTTGAGGAATGAATACTTGTTTGGCTCAGTTGGTGGTGATCAACACTTGGTCATATGGGATATTAGAGCACAAACAACTGACAAGCCGATTCAGTCTGTATTTGCTCATCGAGATGTGGTTAATTGCTTGGCATTCAATCCTGCCAATGAGTGGCTTGTAGCAACAGGTTCAGCTGATAAGACTGTTAAGTTGTTTGACCTCCGCAAACTAAGCACTTCTCTTTATGCCTTTAATTATCACAAGGAAGAAGTTTTCCAAGTGGGATGGAGTCCAAAAAAGGAGTCGATACTAGCATCTTGCTGTGCTGGTAGGAGGATTTTAGTGTGGGATTATAGCAGGATTGGCGATGAACAGGACCCAGAGGATGTAGAAGATGGTCCACCGGAACTTTTGTTCATACACGGCGGTCACACTACCAAGATATCCGATTTTTCTTGGAACCCTTACGATGAGTGGGTGATTGCTAGCGTCGCTGATGATAACATACTTCAAGTATGGCAGATGGCTGAGAATCTTTACTACAGTCACGGTGATAAACTACCACCTGATGAACCTTCATCACCTAGAACACCCACTTAA
Protein:  
MSKSEEDSKPAAADDRRLDEEYKIWKQNTPFMYTSSSPTSRVPSMTVSGCRSERRLAGKDYSXHKLILGTYAGDERNYLMIAEVQLPIEDAERQMRVFDVDHGEIGVFEGARGKSKVKVIQQIKHEGEVNRARYMPQNSSLIATKTVSAEVHVFDYSKHPYNPPISGGECNPDLRLKGHSSEGYGLSWSHXDAQICLWDINAAPINKSLWPLRSFKVNEDAVEDVAWHLRNEYLFGSVGGDQHLVIWDIRAQTTDKPIQSVFAHRDVVNCLAFNPANEWLVATGSADKTVKLFDLRKLSTSLYAFNYHKEEVFQVGWSPKKESILASCCAGRRILVWDYSRIGDEQDPEDVEDGPPELLFIHGGHTTKISDFSWNPYDEWVIASVADDNILQVWQMAENLYYSHGDKLPPDEPSSPRTPT